IRREGULAR OPTICAL INTERCONNECTIONS TO COMPENSATE FOR 
NON-UNIFORMITIES IN ANALOG OPTICAL PROCESSORS 



FIELD OF THE INVENTION 

This invention relates generally to analog optical processors, such as matrix- 
vector optical processors, which perform analog computations and then convert the 
answer to digital, and then compensate for non-uniformities in such processors that result 
in computation errors. 

BACKGROUND OF THE INVENTION 

Most processing technology today relies on electricity. For example, in a typical 
semiconductor processor, electrons move within transistors for the processor to function. 
Such processors, when appropriately configured, are the current mainstay in digital 
technology, which represents information as discrete one or zero bits. However, optical 
processing technology has slowly been advancing. Optical processing technology relies 
on light with an adjunct of electronics. In optical processors, photons move within 
optical and electro-optical devices, causing the processor to function. Optical processors 
are usually analog technology, in which information can be represented on a continuous 
(i.e., grayscale) basis. 

One type of analog optical processor is a matrix- vector (M-V) processor, which 
computes the product of an input vector and a matrix. Such processors are useful in 
artificial intelligence (AI) technologies, such as neural networks. M-V multiplication for 
a vector c, equal to (c x ,...,c i9 ...,c N ) , is computed as shown in equation (1) for each i th - 
component of vector c. 
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c^Ja^j (1) 
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Components and bj can be represented in analog by two optical devices exhibiting 
grayscale. The components are analog in optical transmittance that is normalized to the 
maximum transmittance possible. For example, let ay and bj values be 0.5-grayscale for 
5 both devices with LO-grayscale as the normalized maximum transmittance. Light 

passing through both devices will have a value of 0.25-grayscale. For the calculation of 
two numbers, however, each device could be representing a number from 0 to 100. Thus, 
the output of both devices represents numbers from 0 to 10,000 (i.e., 0 2 to 100 2 ). 
Because each device is at 0.5-grayscale, each device represents the number 50 (0.5 times 

10 100). The output of both devices is representing the number 2,500 (0.25 times 10,000). 
Thus, the output has performed a multiplication, because 50 times 50 is 2,500. 

Thus, when light passes through both devices representing the ay and bj values, 
the transmittance of the light is the product, or aybj. The summation of the aybj values for 
Cj is accomplished by combining the relevant individual light beams with a cylindrical 

15 lens, and imaging the beams on the i th -detector of a detector array. N light beams for 
each of the N components of vector c are imaged in parallel on the detector array. Each 
detector in the array has an analog response to the light beams, so that each detector has a 
component of vector c in an analog form. The components are then converted to digital 
by an analog-to-digital (A/D) converter. 

20 A limitation in using analog optical M-V processors, and other types of analog 

optical multiplication processors, is that non-uniformities in the optical devices 
representing the components to be multiplied cause errors in the resulting product. This 
limits the bit-width of these processors. The non-uniformities make it difficult to 
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accurately represent the values for the components ay and bj in grayscale, because the 
grayscale responses for each component commonly differ from each other. A single A/D 
converter is used to convert the analog to the digital in the optical M-V processor. The 
A/D converter is designed by the characteristics of all of the optical devices in the 
processor. Non-uniformity implies a range of different characteristics. Thus, an A/D 
converter cannot be well design to accurately make the numerical conversions from 
grayscale. Since the uniformities, and lack thereof, determine the bit-width of the 
answers generated by analog optical processors, compensating for non-uniformities is 
important to achieving a high bit-width, which is necessary to have high-precision optical 
processors. 

The grayscale response for the components ay and bj are specifically not entirely 
accurate. Rather, they represent the quantities ay + Aay and bj + Abj, where Aay and Abj 
are error values resulting from the non-uniformities of the optical devices. An M-V 
processor therefore computes the i th -component in vector c as shown in equation (2). 

Zi = f>, + Aa^bj + Abj) = f>,Z>, + a.Abj + Aa,b } + Aa,A6 y (2) 
The error for a component of the vector c, Ac i? is given by equation (3). 

N N 

Zi - c t = Ac, = Yu a ifi b J + Aa A + Aa o Ab J = H Ac v (3) 

As an example, the j th -error for Ac i? Acy, can be determined by assuming that the 
two components for ay and bj exhibit the following grayscale characteristics. The ay's off 
state is 0.02-grayscale and on state is 0.95-grayscale. The bj's off state is 0.03-grayscale 
and on state is 1.05-grayscale. Using the numerical numbers from the earlier example, 
the input for a y and bj is from 0 to 1000. If the number 50 is input to both ay and b j? the ay 



component has a grayscale response of 0.485 that represents the number 48.5, and the bj 
component has a grayscale response of 0.54 that represents the number 54, assuming 
linearity and ideal control. Therefore, the ay grayscale error, Aa lj? is -1.5, and the bj 
grayscale error, Abj, is 4. Using equation (3), ACy = 119, which is also (48.5 x 54) - (50 x 
50) = 2619 - 2500. Thus, the corresponding numerical error is 1 19. 

Therefore, this error must be subtracted in order for the M-V processor to 
correctly multiply input vectors with matrices. One approach is to subtract Aay and Abj 
within the system. If these error quantities are known, they can be subtracted. Assuming 
that the two optical devices in the M-V processor representing ay and bj are well 
characterized, the grayscale responses for all the components can be computed, and their 
responses compensated. For example, if the number for ay is set to 50, as above, then ay- 
converts to 48.5 as a result of the Aay error. This error can be adjusted, assuming 
linearity and ideal control. If 51.6129 is instead input, the ay component converts to 50, 
the desired value. The conversion to obtain 50 is shown in equation (4). 

Q - 5Q -°- Q2 xl00 = 51.6129 (4) 
0.95-0.02 

In principle, such error compensation could be performed by using digital 
electrical processing prior to the optical analog processing. However, a large amount of 
processing is necessary to compensate for all of the components in a modern optical M-V 
processor to increase the bit-width of the answer. Such an amount of digital electrical 
processing defeats the performance advantage of the optical processor. Thus, what is 
needed is a method for improving the bit-width (i.e., the precision) of the optical 
processor without increasing the overall processing time. 
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SUMMARY OF THE INVENTION 

The invention uses irregular optical interconnections to compensate for device 
non-uniformity in analog optical processors, such as matrix- vector (M-V) optical 
processors. An M-V processor of one embodiment includes two optical devices, such as 
5 spatial light modulators (SLM's). One of the devices represents a matrix, and the other 
device represents a vector. Typically, this other device represents a given vector in 
multiple locations. Each device has non-uniformities. The non-uniformities of the 
devices are substantially matched to one another, by matching grayscale responses of the 
device with one another. Due to the randomness of the non-uniformities, the most 

1 0 optimal grouping of matched devices uses optical interconnections between the devices 
that are irregular. Light traveling through the devices represents the interim calculations 
for the product of the matrix and the vector. When the light is fanned into the detectors 
(by an irregular optical interconnection), which is a summation process, the analog part 
of the processor has computed each of the components for the resultant vector. An 

15 analog-to-digital (A/D) processor, or converter, after each detector is used to generate the 
digital component of the resultant vector with substantial reduction in the errors. 

It can be seen that a uniform set of devices can lead to an improvement in the bit- 
width. For this method, uniform generally means that for any sub-device utilized in the 
computation of the i th component of vector c, (c v ... 9 c n ... 9 c N ) , the off state and on state 

20 characteristics are the same for all of these sub-devices. For example, let all sub-devices 
representing ay and bj, bf , have a grayscale off state of g 0 ff. For any of the light beams 
arriving at the i th detector of the detector array, the grayscale multiplication is (g off ) . 
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Collectively, the N light beams have a gray scale of N ■ {g ojf f . Similarly, the highest 

M-V multiplication has a gray scale of N • (g on f . 

The A/D converter is constructed to range between both of these grayscale 
extremes. For the i th component of vector c, the lowest number, N 2 • (cf # ) 2 , is assigned 
to the grayscale of N • (g on f . The value of having identical sub-devices is that an 
increase of any number An represented by a sub-device will have the appropriate 
grayscale increase, Ag, for that sub-device and this grayscale increase, Ag, will be the 
same for all of the sub-devices. However, this grayscale increase, Ag, will not be the 
same for all of the sub-devices unless there is uniformity for the 2N sub-devices utilized 
in the computation for the i th component of vector c. Numerical simulations of the 
optical processor should be used to determine the processor bit-width as a function of 
sub-device uniformity. 

The invention provides for advantages over the prior art. In the prior art, the non- 
uniformities in the system imply that there are errors in the representations of the 
numbers. After the analog optical processing has been performed, an A/D converter 
generates the numbers from their analog representation format. Because the 
representations are not the same, an A/D converter cannot convert the analog 
representation to digital with a high bit-width. The invention can group devices that are 
more uniform with each other, such that a more well characterized A/D converter can be 
constructed. Thus, a higher bit width is obtained. The A/D converters operate in parallel 
so that the same amount of time is required, as in the prior art, for the A/D conversion 
process. Thus, the invention achieves an increase in the bit-width of matrix-vector 
computations more advantageously than in the prior art without increasing the overall 
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processing time. Other advantages, aspects, and embodiments, of the invention will 
become apparent by studying the detailed description that follows and by referencing the 
accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a diagram of a matrix-vector (M-V) analog optical processor according 
to an embodiment of the invention. 

FIG. 2 is a diagram showing an optical device, such as that which may be used to 
represent the matrix and/or the vector in multiple locations in the processor of FIG. 1, in 
more detail, according to an embodiment of the invention. 

FIG. 3 is a diagram showing more specifically how the processor of FIG. 1 
achieves irregular optical interconnections to compensate for non-uniformities within the 
optical devices of the processor, according to an embodiment of the invention. 

FIG. 4 is a diagram showing how one embodiment of the invention achieves the 
irregular optical interconnections of FIG. 3 using prisms. 

FIGs. 5, 6, and 7 are diagrams showing the optics of using prisms to achieve 
irregular optical interconnections as in FIG. 4. 

FIG. 8 is a flowchart of a method according to an embodiment of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

In the following detailed description of exemplary embodiments of the invention, 
reference is made to the accompanying drawings that form a part hereof, and in which is 
shown by way of illustration specific exemplary embodiments in which the invention 
may be practiced. These embodiments are described in sufficient detail to enable those 



skilled in the art to practice the invention. Other embodiments may be utilized, and 
logical, mechanical, electrical, and other changes may be made without departing from 
the spirit or scope of the present invention. The following detailed description is, 
therefore, not to be taken in a limiting sense, and the scope of the present invention is 
defined only by the appended claims. 

FIG. 1 shows an optical analog processor 100 according to an embodiment of the 
invention, which is more specifically a matrix-vector processor. The processor 100 
includes light sources 102, a first optical device 104, a second optical device 106, 
detectors 108, analog-to-digital (A/D) converters 110, and drivers 112 and 114. The light 
sources 102 provides beams of light within the processor 100, for traveling through the 
optical devices 104 and 106, and then for detection by the detectors 108. The light 
sources 102 may be, for example, vertical-cavity surface-emitting lasers, or another type 
of light source. 

Each of the first and the second optical devices 104 and 106 can be a spatial light 
modulator (SLM), or another type of optical device. The first optical device 104 
specifically represents a vector, and the second optical device 106 represents a matrix. 
The devices 104 and 106 have coupled thereto electrical drivers 1 12 and 1 14, 
respectively, that power the devices 104 and 106. The devices 104 and 106 have non- 
uniformities that affect their representation of the vector and the matrix, respectively. To 
compensate for these non-uniformities, the non-uniformities of the device 104 are 
matched to the non-uniformities of the device 106, such that the optical interconnections 
between the devices 104 and 106 are irregular. That is, the grayscale responses of the 
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devices 104 and 106 are matched to one another. This is described in more detail later in 
the detailed description. 

Light traveling through the first device 104 and then through the second device 
106 thus represents the matrix-vector product, when summed at the detectors 108, of the 
5 vector and the matrix represented by the devices 1 04 and 1 06, respectively. The 
detectors 108 detect such light, and specifically may be an array of detectors. The 
detectors 108 convert the light detected into electrical energy, which is converted from 
analog to digital by the A/D converters 110, which may specifically be an array of 
converters. The A/D converters 110 therefore provide the output 116 of the processor 
10 100, representing the matrix-vector product of the vector and the matrix represented by 
the devices 104 and 106, respectively. The A/D converters 110 will subtract to a larger 
degree any errors in the matrix-vector product that remain even after substantially 
matching of the non-uniformities of the devices 104 and 106, by non-digital electrical 
|1| processing. 

ffl 1 5 FIG. 2 shows in more detail an optical device 202 that may act as one of the 

M optical devices 104 and 106 of FIG. 1. The optical device 202 is divided into sub-devices 

organized into columns 204 and rows 206, such as the sub-device 208. The optical 
device 202 is specifically a 4x4 device, because there are four rows and four columns of 
sub-devices, although the invention is generally applicable to any MxP device. The 
20 grayscale response of each sub-device is different, such that the non-uniformities of each 
sub-device are different. 

The non-uniformities of each sub-device are characterized, and the sub-devices 
are organized into four groups, with each group including four sub-devices. Each group 
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represents a component of the matrix, a ij? or the vector, b j? depending on whether the 
optical device 202 is representing the matrix or the vector, where the sub-devices of each 
group represent the grayscale response of a component. Each group is organized to 
optimize the accuracy of a corresponding component of the matrix-vector product, cj, 
based on the characteristics of the grayscale response or non-uniformities of the sub- 
devices within each group. As shown in FIG. 2, shaded sub-devices 210, 212, 214, and 
216 represent one such group. 

More specifically, the characterizing and matching of the non-uniformities of the 
sub-devices within each group allows for efficient subtraction of the errors Aa a and Abj 
within the processor 100. Because the values Aa^ and Abj are known and grouped by 
similarity for each i th -component in vector c, they can subsequently be efficiently 
subtracted at the A/D converters 110 (not shown in FIG. 2) for each i th -detector by non- 
digital electrical or photonic processing. Because processing is performed in parallel at 
the detectors 108 (also not shown in FIG. 2), the maintains the same amount of 
processing time as the prior art needs, but increases the bit-width of the computed 
answer. 

FIG. 3 shows how the sub-devices of the optical devices 102 and 104 of the 
processor 100 are optically interconnected in an irregular manner to achieve increased 
bit-width of the resulting matrix- vector product. The optical device 102 representing the 
input vector has sub-devices 302, 304, 306, and 308 grouped together to represent the 
input vector, (bi, b 2 , . . b N ). Similarly, the optical device 104 representing the matrix 
has sub-devices 302', 304', 306', and 308' grouped together to represent one component 
of the matrix, ay. The non-uniformities, or grayscale response, of the sub-devices 302, 
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304, 306, and 308 of the device 102 are specifically matched to those of the sub-devices 
302', 304', 306', and 308' of the device 104 to minimize errors in the resulting matrix- 
vector product computation. 

The light beams traveling through the sub-device 302, 304, 306, and 308, and 

5 then through the sub-devices 302', 304', 306', and 308' are along irregular optical 

interconnections between the devices 102 and 104. An irregular optical interconnection 
generally is one in which light is specifically directed from one particular position to 
another particular position, such that the interconnections lack symmetry. By 
comparison, a regular optical interconnection is .generally one in which the light beams 

1 0 display symmetry. For example, the light paths may be parallel to one another, or they 
may be diffracted from a single point source such that they fan out in a regular manner. 
Due to the randomness of the non-uniformities of the devices, optimal interconnections 
will with high probability need irregular interconnections. 

The irregular optical interconnections between the devices 102 and 104 in FIG. 3 

1 5 thus are such that one light beam travels on a path from the sub-device 302 to the sub- 
device 302', which is not a regular path as compared to the light beam traveling on a path 
from the sub-device 304 to the sub-device 304', and so on. After traveling through the 
optical devices 102 and 104, the sum of the individual beams of light represent the i - 
component of the matrix-vector product, Cj, and are interconnected on the detector 310, 

20 where the detector 1 08 is actually an array of detectors including the detector 310. Such 
interconnection is also accomplished by an irregular interconnection scheme described 
later, or by another manner. 
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The detectors 108 converts the detected light to electrical energy, which is then 
passed along to the A/D converters 1 10 to convert the energy from analog to digital, such 
that the output 1 16 represents the matrix-vector product of the vector represented by the 
device 102 and the matrix represented by the device 104. As has been indicated, the 
converters 1 10 will subtract to a larger degree any errors that remain in the matrix-vector 
product after the non-uniformities of the device 102 have been matched to the non- 
uniformities of the device 104. 

FIG. 4 shows how the irregular optical interconnections between the optical 
devices 102 and 104 can be achieved by using prisms. Between the devices 102 and 104 
of the processor 100 are prism arrays 402 and 404, including prisms such as the prisms 
406 and 408, respectively. The irregular optical connection between sub-device 302 of 
the device 102 and the sub-device 302' of the device 104 is specifically shown. The 
prism 406 redirects the light beam emanating through the sub-device 302 to the prism 
408. Similarly, the prism 408 redirects the light beam emanating through the prism 406 
to the sub-device 302'. Other devices other than prisms, can also be used. 

FIGs. 5, 6, and 7 show the optics of the prisms 406 and 408 of FIG. 4. The prisms 
406 and 408 are identically constructed, but oriented differently in their respective prism 
arrays 402 and 404 (not shown in FIGs. 5, 6, and 7). By rotating the prism through an 
angle a, as indicated by the reference number 602 in FIG. 6, the exiting light from a 
prism sweeps out a cone shape, if the input light beam is normal as is shown for the prism 
406 in FIG. 5. The prism wedge angle 0, indicated by the reference number 502 in FIGs. 
5 and 7, determines the deflection of light angle <|>, indicated by the reference number 504 
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in FIG. 5, from the exiting normal face of the prism. The angle <j) determines the cone 
angle. 

Thus, the prism 406 at any position in the prism array 402 can potentially be made 
to direct the light beam to a position anywhere in the prism array 404. The prism 408 in 
the prism array 404 at the position of interest can change the direction of the light beam 
so that it exits the prism 408 normal to its exiting face, if the prism 408 is so 
appropriately oriented. The exit light beam of the prism 408 is thus parallel to the input 
light beam of the prism 406, and is appropriately translated to achieve the irregular 
optical connection. 

FIG. 8 outlines a method 800 for compensating for non-uniformities within the 
optical devices of an analog optical processor, such as a matrix-vector processor, using 
irregular optical connections. The non-uniformities of the optical devices are first 
characterized, such as by characterizing the grayscale responses of the devices (802). 
The non-uniformities are at least substantially matched so that the error in the product 
that will result between the mathematical constructs represented by the optical devices is 
minimized, such as by matching the grayscale responses of the devices (804). The term 
mathematical constructs is a general term meant to encompass input vectors, matrices, 
and so on. Based on the matching of the non-uniformities of the devices, the devices are 
optically interconnected with each other in an irregular manner (806). This may include 
using prisms, as has been described. 

Beams of light are then provided that travel through both devices along the 
irregular optical interconnections to achieve the product of the mathematical constructs 
represented by the two optical devices (808). In the case of a matrix-vector processor 
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where one optical device represents an input vector and the other optical device 
represents a matrix, this product is specifically a matrix-vector product. The multiple 
light beams representing the product, after having traveled through both of their 
respective devices, are appropriately summed and detected to measure the product as 
electrical energy (810), such as by a detector array. Any errors that remain in the 
product, even after matching the non-uniformities has occurred, can be subtracted by 
non-digital electrical processing of an A/D converter (812), which also converts the 
resulting product from analog to digital. The output of the A/D converter thus represents 
the product of the mathematical constructs represented by the two optical devices. 

It is noted that, although specific embodiments have been illustrated and 
described herein, it will be appreciated by those of ordinary skill in the art that any 
arrangement is calculated to achieve the same purpose may be substituted for the specific 
embodiments shown. This application is intended to cover any adaptations or variations 
of the present invention. For example, where the invention has largely been described in 
the context of an analog optical processor that is a matrix-vector processor, the invention 
is applicable to other types of analog optical processors as well. Therefore, it is 
manifestly intended that this invention be limited only by the claims and equivalents 
thereof. 



14 



